Predicting Poll Trends Using Twitter and Multivariate Time-Series Classification

نویسندگان

  • Tom Mirowski
  • Shoumik Roychoudhury
  • Fang Zhou
  • Zoran Obradovic
چکیده

Social media outlets, such as Twitter, provide invaluable information for understanding the social and political climate surrounding particular issues. Millions of people who vary in age, social class, and political beliefs come together in conversation. However, this information poses challenges to making inferences from these tweets. Using the tweets from the 2016 U.S. Presidential campaign, one main research question is addressed in this work. That is, can accurate predictions be made detecting changes in a political candidate’s poll score trends utilizing tweets created during their campaign? The novelty of this work is that we formulate the problem as a multivariate time-series classification problem, which fits the temporal nature of tweets, rather than as a traditional attribute-based classification. Features that represent various aspects of support for (or against) a candidate are tracked on an hour-by-hour basis. Together these form multivariate time-series. One commonly used approach to this problem is based on the majority voting scheme. This method assumes the univariate time-series from different features have equal importance. To alleviate this issue a weighted shapelet transformation model is proposed. Extensive experiments on over 12 million tweets between November 2015 and January 2016 related to the four primary candidates (Bernie Sanders, Hillary Clinton, Donald Trump and Ted Cruz) indicate that the multivariate time-series approach outperforms traditional attribute-based approaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting Top-k Trends on Twitter using Graphlets and Time Features

We introduce a novel method for predicting trends on Twitter. This new method exploits topology of studied sub-networks. It is based on a combination of graphlet spectra and so-called time features. We show experimentally that using graphlets and time features is beneficial for the accuracy of prediction.

متن کامل

A Latent Source Model for Nonparametric Time Series Classification

For classifying time series, a nearest-neighbor approach is widely used in practice with performance often competitive with or better than more elaborate methods such as neural networks, decision trees, and support vector machines. We develop theoretical justification for the effectiveness of nearest-neighbor-like classification of time series. Our guiding hypothesis is that in many application...

متن کامل

Statistical modeling of the association between pervasive precipitation anomalies in Southern Alburz and global ocean-atmospheric patterns

Precipitation patterns are influenced by many factors, such as global atmospheric circulations to name but one. Precipitation patterns in Iran have always had great fluctuations even in a smaller scale like the Alburz Mountain Range. The present research has tried to find the relationship between global atmospheric patterns and the pervasive precipitation ones in Alburz. For doing so, 17 climat...

متن کامل

Statistical modeling of the association between pervasive precipitation anomalies in Southern Alburz and global ocean-atmospheric patterns

Precipitation patterns are influenced by many factors, such as global atmospheric circulations to name but one. Precipitation patterns in Iran have always had great fluctuations even in a smaller scale like the Alburz Mountain Range. The present research has tried to find the relationship between global atmospheric patterns and the pervasive precipitation ones in Alburz. For doing so, 17 climat...

متن کامل

Extracting Strong Sentiment Trends from Twitter

Twitter is a popular real-time microblogging service that allows its users to share short pieces of information known as “tweets” (limited to 140 characters). Users write tweets to express their opinions about various topics pertaining to their daily lives. With a total 175 million users and 95 million tweets published per day (as of September 2010), Twitter serves as an ideal platform for the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016